Device for tracking objects in a video stream

ABSTRACT

The invention discloses a device for tracking objects in a video stream comprising a processing unit in turn including a selection module configured for selecting video portions comprising sequences of a predetermined object, from the video stream, a module for tracking positions of the predetermined object in the video portions, a module for creating object&#39;s video streams, configured for creation of partial video streams as a function of the tracked positions, each said partial video stream representing one said tracked object, the device further comprising a composition module configured for composing the partial video streams of the individual objects into an output video stream. 
     The invention further discloses a transmission system comprising the device for tracking objects.

FIELD OF THE INVENTION

The present invention relates to a device for tracking objects in avideo stream. In particular, although not exclusively, the objects canbe represented by buyable and/or selectable goods from a web site forexample, and the video stream can be that shown on a computer. Duringthe present specification reference will be made to these particularinstances for clearness of exposition.

KNOWN ART

It is well known that the web makes available many e_commerce sites.Likewise, many other sites allow browsing between objects of interestenabling selection thereof so as to evaluate the particular features ofsame.

Usually, a user carries out browsing between objects of interestutilizing search engines. These engines presently represent the main wayfor identifying the contents of interest for the user.

A consumer user who is willing to acquire or examine a product/object ona web site has the possibility of selecting this product from a list orthrough a corresponding photo, or an icon representative of the object.

For buying it, the consumer user utilizes known equipment of electroniccommerce technologies.

The selectable objects are represented in a static manner, often takenout of the environment in which they can be or are used.

Referring particularly to objects such as garments, the impossibility ofshowing the garment effect when worn and how it should be worn, as wellas the environment for which it is best suited, often represents adeterrent rather than an encouragement to buying it. Generally, theselection of objects available on the web is linked to a presentation ofsame that is necessarily static and stiff inevitably neglecting part ofthe graphic, technical and functional features of said objects, causinga drop in the consumer's interest.

In this context, the technical task underlying the present invention isto propose a device for tracking objects in a video stream capable ofovercoming the drawbacks of the above mentioned known art.

Specifically, it is an object of the present invention to provide adevice for tracking objects in a video stream capable of improvingprocess of selection of the same objects.

Another aim of the invention is to enable a selection of objects thatare dynamically represented.

It is a further aim to enable a specific display of the functionalqualities and/or features of the represented objects.

SUMMARY OF THE INVENTION

The technical task mentioned and the aims specified are substantiallyachieved by a device for tracking objects in a video stream, comprisingthe technical features set out in one or more of the appended claims.

-   -   The invention offers the following advantages:        -   simplified selection of the object;        -   consequent simplification in buying the selected object;        -   better display of the object's features;        -   greater promotional effect on the consumer.    -   Further features and advantages of the present invention will be        better clarified by the description given by way of non-limiting        example of a preferred but not exclusive embodiment of a device        for tracking objects in a video stream as shown in the        accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram of the device of the invention;

FIG. 2 is a detail of one block of the diagram shown in FIG. 1;

FIG. 3 shows a block diagram of a transmission system comprising thedevice of the invention.

DETAILED DESCRIPTION

A device for tracking objects in a video stream comprises a processingunit that in turn includes a plurality of modules configured forconverting the incoming simple video stream into an outgoing videostream containing information on the shown objects. Preferably, thisinformation is meta-information in the video stream, the termmeta-information indicating not only tags but also an assembly of tagswith other information added thereto.

In particular, with reference to FIG. 1, a device 1 for tracking objectsin a video stream S_(IN) is shown. Preferably, the device 1 comprises aconversion module 20 configured for converting a Source video streaminto the video stream S_(IN).

In other words, the conversion module 20 converts a Source stream into aformat of the video stream S_(IN) as required for subsequent processing.

The objects will be denoted in the specification by the identifyingletters “Oi”, which will represent the i-th object in an assembly of nobjects shown in the video stream S_(IN).

Preferably, the video stream S_(IN) is a fashion film and the objects Oicomprise garments shown for advertising purposes in this film.

Alternatively or in addition, the video stream S_(IN) can be a film inwhich objects of various kinds are shown, such as a car, a building or atree, for example. Alternatively or in addition, the object can be areference to a service; for instance in a video a reference object to ahair-dresser, a bookshop, a shop or other can appear on a building.

Generally, the video stream S_(IN) can be a film in which the objectsare merely shown as part of an environment without any reference to thepossibility of buying them.

Generally, the objects Oi can be any selectable or saleable product.

Preferably, device 1 comprises a processing unit 10. Generally it shouldbe noted that in the present context and the following claims, thetracking device 1 will be shown as divided into distinct storage modulesand operating modules for the only purpose of describing the device'sfunctional operations in a clear and full manner.

These operations can be performed by a single electronic device suitablyprogrammed and the different modules can correspond to hardwarecomponents and/or software routines being part of the programmed device.Alternatively or in addition, these operations can be performed by aplurality of electronic devices among which the modules can bedistributed.

In addition, the involved devices can take advantage of one or moreprocessors for execution of the instructions contained in the memory.

Further, the storage and operating modules can be locally or remotelydistributed among different computers, based on the architecture of thenetwork in which they are.

With reference to FIG. 1, the processing unit 10 in turn comprises aselection module 13 configured for selecting video portions V comprisingsequences of a predetermined object Oi, from the video stream S_(IN).

The composition of the video portions V relating to each individualobject determines a video representing the object's life cycle from theappearing stage to the leaving stage.

According to the invention, the video stream can be that shown on acomputer and/or a television, and/or a mobile apparatus such as a mobilephone or an iPad. Alternatively or in addition, the video stream can beshown on any other display means in which an interaction therewith ispossible, through a mouse device, a touch screen or the like, forexample.

In an embodiment of the invention, the video stream S_(IN) is a fashionfilm.

The objects Oi, in the particular example are a shirt worn by a modelfor example, a scarf, a pair of shoes, a skirt, etc., the selectionmodule 13 selecting portions of the film in which at least the selectedshirt, or alternatively the skirt, or alternatively the scarf, etc. ispresent.

Alternatively, the video stream can also be a soap opera if itreproduces tagged objects, displayed by means of a decoder of a web-tvor an iptv operator, for example.

Preferably, the film must show objects to be advertised to a rather highconcentration.

The processing unit 10 further comprises a tracking module 14 configuredfor tracking positions P of each predetermined object Oi in the selectedvideo portions V.

In other words, the tracking module 14 determines the edges of eachobject Oi at each position P in the video portions V relating to thespecific object Oi selected. Preferably, the positions of points P aredefined by an operator.

Alternatively, a suitable program generates these positions.

The tracking module 14 comprises an interface 17 for setting referencesRi representative of each predetermined object Oi in the video portionsVi.

In other words, through the interface 17 an operator sets referencepoints representative of an object Oi in a given position P, forinstance points that, through interpolation, can represent a shirt, askirt, etc.

An association module 18 included in the tracking module 14 isconfigured for associating predetermined polygonal shapes Gi with theobject Oi in positions P, starting from references Ri.

Preferably, this association module 18 comprises a known linearapproximation algorithm interpolating the reference Ri sequences andcomparing them with the predetermined polygonal shapes Gi.

Preferably, a storage module 22 is associated with the associationmodule 18 and comprises a library of predetermined polygonal shapes Gi.

In the preferred embodiment this library will comprise polygonsrepresentative of a shirt, a skirt, shoes, etc.

After the objects Oi have been tracked in positions P, partial videostreams are created for each individual object represented.

To this aim, the processing unit 10 comprises a module 15 for creatingobject's video streams configured for creation of partial video streamsS_(i) as a function of the tracked positions P.

Each partial video stream S_(i) represents an object Oi tracked inpositions P.

A first display module 19 configured for showing the partial videostreams S_(i) enables an operator to verify the quality of the createdvideo stream S_(i).

Should the operator deem the quality unsatisfactory, new positions P aredefined for the object, interpolation is carried out again and a newpartial video stream S_(i) is created.

According to the invention, once the partial video streams S_(i) arecreated, they are composed again into a single video stream.

To this aim, the device 1 comprises a composition module 16 configuredfor composing the partial video streams S_(i) of the individual objectsOi into an output video stream S_(OUT).

In other words, the output video stream S_(OUT) comprises the partialvideo streams S_(i), tracked with the positions P passed through by eachobject Oi from the moment it appears in the video stream to the momentit leaves the video stream.

Device 1 comprises a conversion module 21 configured for converting theoutput video stream S_(an) into meta-information.

Preferably, this meta-information comprises meta-data in the requiredformat for the web.

Preferably, the requested format is XML.

A user interface 23 of this device 1 is configured for showing theoutput video stream S_(OUT) depending on a request mark signal fortracking objects Oi.

In other words, the user interface 23 allows the consumer through themark signal, to select whether to see or not the video stream S_(OUT)with the tracked objects.

The user interface 23 further comprises a second display module 24configured for showing features of the objects Oi depending on an inputsignal I.

Signal I is representative of a particular choice regarding displayingand/or features and/or additional functions of said objects Oi.

In the preferred embodiments, signal I is representative of one of thechoices of displaying:

-   -   information on the product;    -   information on the brand;    -   correlated products;    -   addition to a wish list.

Generally, signal I is representative of many different effects ofman/video interaction.

Operation of the device according to the invention can be inferred fromthe above description.

During reproduction of a video stream, the user selects the objects ofinterest for displaying the object's features and/or the features of theservice connected therewith for possible purchase.

In a preferred embodiment of the invention, the device for trackingobjects Oi in a video stream S_(IN) operates in a transmission network200 in which the created meta-information Mi is transmitted.

Device 1 can operate in a local area network (LAN), wide area network(WAN) or space network (WEB).

Preferably, the transmission sphere is the world wide web and the usedprotocol is the TCP/IP (the'Internet protocol).

In other words, the invention relates to a meta-information Mitransmission system in a transmission network 200.

The transmission system 200 comprises at least one device 1 for trackingobjects Oi in a video stream S_(IN), as previously described.

The processing unit 10 of device 1 comprises:

-   -   a selection module 13 configured for selecting video portions V        comprising sequences of a predetermined object Oi, from the        video stream S_(IN);    -   a module 14 for tracking positions P of the predetermined object        Oi in the video portions V;    -   a module 15 for creating object's video streams, configured for        creation of partial video streams S_(i) as a function of the        tracked positions P, in which each partial video stream S_(i)        represents one tracked object Oi;    -   a composition module 16 configured for composing the partial        video streams S_(i) of said individual objects Oi into an output        video stream S_(OUT).    -   a conversion module 21, configured for converting said output        video stream S_(OUT) into meta-information Mi.

According to the invention, the meta-information Mi transmission systemfurther comprises at least one destination device 101 configured forreceiving the meta-information Mi.

In other words, object Oi tracked by the tracking device 1 becomes“taggable”, i.e. it can be extrapolated from the video stream in whichit is shown.

Preferably, the meta-information comprises meta-data (meta-tags) in theformat required for the web.

The meta-tags contained in the meta-information Mi can be transferredfrom the tracking device 1 to the destination device 101.

According to the invention, the destination device 101 comprises aninterface 103 configured for showing the meta-information Mi.

The destination device 101 further comprises a memory 102 and aprocessing unit 104 for the instructions contained in such a memory.

The interface 103 is configured for receiving an input signal Irepresentative of a particular choice regarding displaying and/orfeatures and/or additional functions of the objects Oi having determinedthe meta-information Mi.

In other words, the interface 103 is configured for showing themeta-information Mi as a function of an input signal I representative ofa particular choice regarding displaying and/or features and/oradditional functions of the objects Oi having determined themeta-information Mi.

As already said, signal I is representative of one of the displayingchoices of:

-   -   information on the product;    -   information on the brand;    -   correlated products;    -   addition to a wish list.

Generally, signal I is representative of many different effects ofman/video interaction.

According to the invention the destination device 101 is configured forin turn operating as a transmitter of the received meta-information Mi,to other destination devices 121.

In other words, distribution of the objects in the network is similar tothat of youtube and flickr; the user can take the meta-information ofthe object and insert it in another site or blog, thus generating thatwhich is described as viral contents.

Advantageously, according to the invention, it is the userhimself/herself that directly promotes communication and sale of theobject he/she has selected.

In addition, the invention promotes definition of the users' profile; infact, the wish-list provides data on the user's tastes/interests.

Advantageously, according to the invention, it is possible to analyzethe user's choices in a very detailed manner based on the statistics ofuser/contents interaction, for each object and each view.

Since each user shares his/her experience, tastes, interests, etc., withother people, he/she becomes a member of the system of the invention.

In other words, a personal wish list becomes a promotional channel tothe social networks, for promoting goods/services of interest of theuser. Furthermore, distribution by the users of the selected objectsenables definition of a distributed e_commerce created by the usersthemselves.

In other words, the users, by selecting the object and for instanceplacing the code (meta-information) in their blog, enable the metainformation to go round in the net; the meta-information tag remainsactive and causes the purchase functions to be opened again, thusbringing back the users from a generic location on the web to thepurchase site; the link is transmitted from user to user and it is nolonger necessary to known the origin site from which the interactivevideo, i.e. the meta-information comes.

As those skilled in the art will readily understand from the abovedescription, the users' posts are identified in the invention with thedestination device 101, while the site from which the meta-informationcomes is identified in the invention with the tracking device 1.

With the system of the invention, the information distribution on thenet is also valid for the information not concerning commerce.

Therefore users can help in spreading cultural news and information,promoting a brand or an object the user likes, etc.

According to the invention the device 1 for tracking objects isassociated with an operating environment configured for maintaining atrace of the tracking carried out on the video stream S_(IN).

In other words, the operating environment is configured for showing thelist of the tracked objects and enabling direct browsing from the listto the positions in the video stream S_(IN) in which the object Oi isshown.

Preferably, the destination device 101 comprises a web server.

Preferably, the destination device 101 comprises a personal computer.

Preferably, the destination device 101 comprises a television set.

Preferably, the destination device 101 comprises a notebook computer.

Preferably, the destination device 101 comprises a mobile phone.

In other words, the destination device 101 comprises one or more of aweb server, personal computer, TV, notebook, mobile phone or the like.

What is claimed is:
 1. A device for tracking objects in a video stream,said device comprising a processing unit in turn comprising: a selectionmodule configured for selecting video portions, comprising sequences ofa predetermined object, from said video stream; a module for trackingpositions of said predetermined object in said video portions; a modulefor creating object's video streams, configured for creation of partialvideo streams as a function of said tracked positions, each said partialvideo stream representing one said tracked object; a composition moduleconfigured for composing said partial video streams of said individualobjects into an output video stream.
 2. The device for tracking objectsas claimed in claim 1, comprising a conversion module, configured forconverting said output video stream into meta-information.
 3. The devicefor tracking objects as claimed in claim 2, wherein saidmeta-information comprises meta-data (meta-tags) in a format as requiredfor the web.
 4. The device for tracking objects as claimed in claim 1,comprising a user interface configured for showing said output videostream as a function of a request mark signal for tracking said objects.5. The device for tracking objects as claimed in claim 4, wherein saiduser interface comprises a second display module configured for showingfeatures of said objects as a function of an input signal representativeof a particular choice regarding displaying and/or features and/oradditional functions of said objects.
 6. The device for tracking objectsas claimed in claim 1, wherein said tracking module comprises aninterface for setting references representative of each saidpredetermined object in said video portions.
 7. The device for trackingobjects as claimed in claim 6, wherein said tracking module comprises anassociation module configured for associating predetermined polygonalshapes with said object in said positions, starting from saidreferences.
 8. The device for tracking objects as claimed in claim 7,comprising a storage module including a library of said predeterminedpolygonal shapes, said storage module being associated with saidassociation module.
 9. The device for tracking objects as claimed inclaim 1, comprising a first display module configured for displayingsaid partial video streams.
 10. The device for tracking objects asclaimed in claim 1, comprising a conversion module configured forconversion of one source video stream into said video stream.
 11. Ameta-information transmission system in a transmission network,comprising: at least one device for tracking objects in a video stream,said device including a processing unit, in turn comprising: a selectionmodule configured for selecting video portions comprising sequences of apredetermined object, from said video stream; a module for trackingpositions of said predetermined object in said video portions; a modulefor creating object's video streams, configured for creation of partialvideo streams as a function of said tracked positions, each said partialvideo stream representing one said tracked object; a composition moduleconfigured for composing said partial video streams of said individualobjects into an output video stream; a conversion module, configured forconverting said output video stream into meta-information. at least onedestination device configured for receiving said meta-information. 12.The transmission system as claimed in claim 11, wherein said destinationdevice comprises a user interface configured for showing saidmeta-information as a function of an input signal representative of aparticular choice regarding displaying and/or features and/or additionalfunctions of said objects having determined said meta-information. 13.The transmission system as claimed in claim 12, wherein said destinationdevice is configured for in turn operating as a transmitter of saidmeta-information received from said tracking device, to otherdestination devices.
 14. The transmission system as claimed in claim 13,wherein said meta-information comprises meta-data (meta-tags) in therequired format for the web.
 15. The transmission system as claimed inclaim 11, wherein said destination device comprises one or more of a webserver, personal computer, TV, notebook, mobile phone or the like.